Derivational Morphology
نویسندگان
چکیده
Machine Translation (MT) systems that process unrestricted text should be able to deal with words that are not found in the MT lexicon. Without some kind of recognition, the parse may be incomplete, there is no transfer for the unfound word, and tests for transfers for surrounding words will often fail, resulting in poor translation. Interestingly, not much has been published on unfoundword guessing in the context of MT although such work has been going on for other applications. In our work on the IBM MT system, we implemented a far-reaching strategy for recognizing unfound words based on rules of word formation and for generating transfers. What distinguishes our approach from others is the use of semantic and syntactic features for both analysis and transfer, a scoring system to assign levels of confidence to possible word structures, and the creation of transfers in the transformation component. We also successfully applied rules of derivational morphological analysis to non-derived unfound words.
منابع مشابه
Rule-based versus associative processes in derivational morphology.
The present article examines whether derivational morphology shows evidence of an associative memory structure. A distributional analysis of stems of attested derivational forms revealed evidence of clustering around phonological properties (gangs) for all nonneutral affixes but only a few neutral affixes. Subjects' acceptability ratings for novel complex words revealed sensitivity to the gang ...
متن کاملDErivBase: Inducing and Evaluating a Derivational Morphology Resource for German
Derivational models are still an underresearched area in computational morphology. Even for German, a rather resourcerich language, there is a lack of largecoverage derivational knowledge. This paper describes a rule-based framework for inducing derivational families (i.e., clusters of lemmas in derivational relationships) and its application to create a highcoverage German resource, DERIVBASE,...
متن کاملThe Form and Content of Derivational Relations
Morpheme based theories of word structure encourage us to think of derived words (e.g. \innatable") as built by adding one complex of phonological, syntactic and semantic content (/{bl j /, Adjective, `able to be verbed') to another (/innejt/, Verb | +NP]], `inflate'). This model lacks generality in a number of ways, however. (a) Derivational relationships may not involve the simple addition of...
متن کاملParadigm Completion for Derivational Morphology
The generation of complex derived word forms has been an overlooked problem in NLP; we fill this gap by applying neural sequence-to-sequence models to the task. We overview the theoretical motivation for a paradigmatic treatment of derivational morphology, and introduce the task of derivational paradigm completion as a parallel to inflectional paradigm completion. State-of-the-art neural models...
متن کاملHindi Derivational Morphological Analyzer
Hindi is an Indian language which is relatively rich in morphology. A few morphological analyzers of this language have been developed. However, they give only inflectional analysis of the language. In this paper, we present our Hindi derivational morphological analyzer. Our algorithm upgrades an existing inflectional analyzer to a derivational analyzer and primarily achieves two goals. First, ...
متن کاملTowards a Malay Derivational Lexicon: Learning Affixes Using Expectation Maximization
We propose an unsupervised training method to guide the learning of Malay derivational morphology from a set of morphological segmentations produced by a naı̈ve morphological analyzer. Using a morphology-based language model, we first estimate the probability of a given segmentation. We train the model with EM to find the segmentation that maximizes the probability of each morpheme. We extract t...
متن کامل